Tackling CASMI 2012: Solutions from MetFrag and MetFusion
نویسندگان
چکیده
The task in the critical assessment of small molecule identification (CASMI) contest category 2 was to determine the identification of (initially) unknown compounds for which high-resolution tandem mass spectra were published. We focused on computer-assisted methods that tried to correctly identify the compound automatically and entered the contest with MetFrag and MetFusion to score candidate structures retrieved from the PubChem structure database. MetFrag was combined with the metabolite-likeness score, which helped to improve the performance for the natural product challenges. We present the results, discuss the performance, and give details of how to interpret the MetFrag and MetFusion output.
منابع مشابه
The Critical Assessment of Small Molecule Identification (CASMI): Challenges Solutions
The Critical Assessment of Small Molecule Identification, or CASMI, contest was founded in 2012 to provide scientists with a common open dataset to evaluate their identification methods. In this article, the challenges and solutions for the inaugural CASMI 2012 are presented. The contest was split into four categories corresponding with tasks to determine molecular formula and molecular structu...
متن کاملCASMI—The Small Molecule Identification Process from a Birmingham Perspective
The Critical Assessment of Small Molecule Identification (CASMI) contest was developed to provide a systematic comparative evaluation of strategies applied for the annotation and identification of small molecules. The authors participated in eleven challenges in both category 1 (to deduce a molecular formula) and category 2 (to deduce a molecular structure) related to high resolution LC-MS data...
متن کاملMetabolite Identification through Machine Learning — Tackling CASMI Challenge Using FingerID
Metabolite identification is a major bottleneck in metabolomics due to the number and diversity of the molecules. To alleviate this bottleneck, computational methods and tools that reliably filter the set of candidates are needed for further analysis by human experts. Recent efforts in assembling large public mass spectral databases such as MassBank have opened the door for developing a new gen...
متن کاملCASMI - A visualization tool for the World Stress Map database
The World Stress Map (WSM) project has compiled a global database of quality-ranked data records on the contemporary tectonic stresses in the Earth’s crust. The WSM 2005 database release contains approximately 16 000 data records from different types of stress indicators such as earthquake focal mechanisms solutions, well bore breakouts, hydraulic fracturing and overcoring measurements, as well...
متن کاملNew kids on the block: novel informatics methods for natural product discovery.
Covering: 2008 to 2014 Mass spectrometry is a key technology for the identification and structural elucidation of natural products. Manual interpretation of the resulting data is tedious and time-consuming, so methods for automated analysis are highly sought after. In this review, we focus on four recently developed methods for the detection and investigation of small molecules, namely MetFrag/...
متن کامل